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Abstract 



Oh 

We derive the asymptotic distribution of the total length L n of a 
Beta(2 — a, a)-coalescent tree for 1 < a < 2, starting from n indi- 
viduals. There are two regimes: If a < \(\ + "v/5), then L n suitably 
rescaled has a stable limit distribution of index a. Otherwise L n just 
£> ■ has to be shifted by a constant (depending on n) to get convergence 

^"} . to a non-degenerate limit distribution. As a consequence we obtain 

qq | the limit distribution of the number S n of segregation sites. 

Keywords and phrases. Beta-coalescent, coupling, point process, sta- 
L^ ! ble distribution. 
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1 Introduction and result 

In this paper we investigate the asymptotic distribution of the suitably nor- 
malized length L n of a n-coalescent of the Beta(2 — a, a)-type with 1 < a < 2. 
As a corollary we obtain the asymptotic distribution of the associated num- 
ber S n of segregating sites, which is the basis of the Watterson estimator 
[18] for the rate 9 of mutation of the DNA. Here we recall that coalescents 
with multiple merging such as Beta-coalescents have been considered in the 
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literature as model for the genealogical relationship within certain maritime 
species [HI E] ■ 

Beta-coalescents (and more generally A-coalescents, as introduced by Pit- 
man [15] and Sagitov [16J) possess a rich underlying partition structure, which 
is nicely presented in detail in N. Berestycki [3]. For our purposes it is not 
necessary to recall all these details, we refer to the following condensed de- 
scription of a n-coalescent: 

Imagine n particles (blocks in a partition), which coalesce into a single 
particle within a random number of steps. This happens in the manner 
of a continuous time Markov chain. Namely, if there are currently m > 1 
particles, then they merge to / particles at a rate p m ^ with 1 < / < m — 1. 
Thus 

Pm Pm,l T* ' ' * "T pm,m—l 

is the total merging rate and 

D Pm.l i^i^ i 

■r m ,i = , 1 < Z < ra - 1 , 

Pm 

gives the probability of a jump from m to I. 

In these models the rates p m j have a specific consistency structure arising 
from the merging mechanism. As follows from Pitman [15] they are in general 
of the form 

p m , m -k + i = (™) J t k '\l - t) m ~ k A(dt) , 2 < k < m , 

where A(dt) is a finite measure on [0, 1]. The choice A = 5 corresponds to 
the original model due to Kingman [T2], then p m ^ = for I ^ m — 1. In this 
paper we assume 

Mdt) = w ^ m t i - a (i-tr- i dt, 

thus 

(TTl \ 
k JB(k-a,m-k + a) , 

where B(a, b) denotes the ordinary Beta-function. Then the underlying coa- 
lescent is called the Beta(2 — a, a)-coalescent. For a = 1 it is the Bolthausen- 
Sznitman coalescent [5] and the case a = 2 can be linked with Kingman's 
coalescent. 



The situation can be descriped as follows: There are the merging times 
= T < Ti < • • • < T Tn and there is the imbedded time discrete Markov 
chain n = Xq > X\ > ■ ■ ■ > X Tn = 1, where Xi is the number of particles 
(partition blocks) after i merging events. This Markov chain has transition 
probabilities P m> i and, given the event Xi = m with m > 1, the waiting time 
T i+ i — Ti to the next jump is exponential with expectation l/p m . Since a 
point process description is convenient later, we name the point process 

Tn — 1 

i=0 

on {2,3, . . .} the coalescent's point process downwards from n, shortly the 
CPP(n). 

This dynamics can be vizualised by a coalescent tree with a root and n 
leaves. The leaves are located at height T = and the root at height T Tn 
above. At height Tj there are Xi nodes representing the particles after % 
coalescing events. The total branch length of this tree is given by 

Tn — 1 

L n = YsMTi+x-Ti) . (2) 

i=0 

For 1 < a < 2 the asymptotic magnitude of L n is obtained by Berestycki 
et al in j2], it is proportional to n 2 ~ a . The asymptotic distribution of L n is 
easily derived for the Kingman coalescent, see [7], it is Gumbel. The case 
of a Bolthausen-Sznitman coalescent is treated by Drmota et al [8], here L n 
properly normalized is asymptotically stable. The case < a < 1 of a Beta- 
coalescent is contained in more general results of Mohle |T5]. Partial results 
for the Beta-coalescent with 1 < a < 2 have been obtained by Delmas et al 

®- 

In this paper we derive the asymptotic distribution of the Beta-coalescent 

for 1 < a < 2. Let <^ denote a real- valued random variable with a distribution, 
which is stable of index a and normalized by the properties 

E(q) = , Pfc > X) = 0(x~ a ) , P(q < -x) ~ X~ a (3) 

for x — > oo. Thus it is maximally skewed among the stable distributions of 
index a. 



Also let 



r(a)a(a — 1) T(a)a(a — l) lH 

Cl = 2^ ' ° 2 = r(2-a)£ 



Theorem 1. For the B eta- coales cent with 1 < a < 2: 
(i) If 1 < a < |(1 + \/5) (#ww 1 + a - a 2 > 0), then 

L n - c x n 2 ~ a d c 2 <, 



-)• 



X +l-a ' n _l_™_ ™2^ X 



If a = i(l + VS), &en 

L n - d n 2 ~ Q d 



(log n) i 

(iii) If |(1 + \/5) < a < 2, then 

L n - dn 2 ~ a -> r] , 
where rj is a non- degenerate random variable. 

In fact it is not difficult to see from the proof that rj has a density with 
respect to Lebesgue measure. 

This transition at the golden ratio |(1 + v5) gets manifest already in the 
results of Delmas et al [7] . They also show that the number r n of collisions, 
properly rescaled, has asymptotically a stable distribution. This latter result 
has been independently obtained by Gnedin and Yakubovitch [TTj . 

The region within the coalescent tree, where the random fluctuations of 
L n asymptotically arise, are different in the three cases. In case (i) fluctu- 
ations come from everywhere between root and leaves, whereas in case (iii) 
they mainly originate at the neighborhood of the root. Then we have to 
take care of those summands Xj(T i+1 — Tj) within L n , which have an index i 
close to r n . In the intermediate case (ii) the primary contribution stems from 
summands with index i such that r n — n 1 ^ 6 < i < r n — n £ with < e < ~. 

To get hold of the these fluctuations, in proving the theorem we loosely 
speaking turn around the order of summation in L n = X][=o -^"iC^i+i ~ ^)- 
We shall handle the reversed order by means of two point processes jjl and v 
on {2, 3, . . .}. The first one gives the asymptotic particle numbers seen from 
the root of the tree. Here we use Schweinsberg's result [17] implying that the 
Beta-coalescent comes down from infinity for 1 < a < 2, see [3], Corollary 3.2. 
(Therefore our method of proof does not apply to the case of the Bolthausen- 
Sznitman coalescent.) The second one is a classical stationary renewal point 



process, which can be reversed without difficulty. Two different couplings 
establish the links. Thereby the exponential holding times are left aside at 
first stage. In this respect our approach to the Beta-coalescent differs from 
others as in Birkner et al [I] or Berestyki et al pp. 

Coalescent trees are used as a model for the genealogical relationship of n 
individuals backwards to their most recent ancestor. Then one imagines that 
mutations are assigned to positions on the tree's branches in the manner of a 
Poisson point process with rate 9. Let S n be the number of these segregation 
sites (see [3], section 2.3.4). Given L n the distribution of S n is Poisson with 
mean 6L n . To get the asymptotic distribution one splits S n into parts: 

S n - 6 Cl n 2 - a = (S n - 6L n ) + 6(L n - cm 2 " ) . 

Since L n /c\n 2 ~ a converges to 1 in probability, the first summand is asymp- 
totically normal and also asymptotically independent from the second one. 
Its normalizing constant is (8L n )~2 ~ (9ci)~^n^~ l . Again there are two 
regimes: n 1- "^ = o(ii" +1 "°), iff a < y/2. Partial results are contained in 
Delmas et al [7]. We obtain: 

Corollary 2. Let ( denote a standard normal random variable, which is 
independent from <j. 

(i) If 1 < a < \/2 ; then 

S n - 9 Cl n 2 - a d 0c 2 <, 



-> 



n^ +1 - a {l + a-a 2 )t 

(ii) If a = a/2, then 

S n - 9 Cl n 2 ~ a d r—^ , 6c 2 <, 



n 1 ' 



A V^iC + 



a — a 2 )'' 



m 



If y/2 < a < 2, then 



S n -9 Cl n 2 - a d r— 

r^ ->■ V^ciC 

n 2 



This is the organisation of the paper: Section 2 contains an elementary 
coupling of two N- valued random variables. It is used in section 3, where we 
introduce and analyse coalescent's point processes, and in section 4, where 
we couple these point processes to stationary point processes. Section 5 
assembles two auxiliary results on sums of independent random variables. 
Finally the proof of Theorem [T] is given in section 6. 



2 A coupling 

In this section let the natural number m be fixed. We introduce a coupling of 
the transition probabilities P m j and a distribution, which does not depend on 
m. From the representation of the Beta-function by means of the T-function 
and its functional equation we have 

l m\ Y[k — a) Y(m — k + a) 

Pm,m-k+l = r(2-a)r(a) r(m) £j ( m - k)\ 

1 F(k — a) (m — k + 1) • • • m T(m + a) 



T(2-a 



) r (°) r(fc + 1) (m - k + a) ■ ■ ■ (m - 1 + a) T(m 



thus 



with 



m,m—k 



•1) 


(m — k + a 


) ■ ■ ■ (rn 


-1 + a) 




r(* + i 

mfc T(A; + 


— a) 
2) ' 


k> 1 




(m — k) ■ 


• • (rn — 


1) 



" (m + a — k — 1) • • • (m + a — 2) 

and a normalizing constant d m > (also dependent on a). Recall from the 
introduction that given X Q = m the quantities P m ,m-k are the weights of 
the distribution of the downward jump U = X — Xi, For a more detailed 
discussion of this 'law of first jump' we refer to Delmas et al [?]. 

It is natural to relate this distribution to the distribution of some random 
variable V with values in N and distribution given by 

P(V = k)= ° ^M 1 "^ , *>1. (4) 

V ; r(2-a) T(fc + 2) ' - v ; 

This kind of distribution appears for Beta-coalescents already in Berestycki 
et al [1] (in the context of frequency spectra) as well as in Delmas et al 
[7] . There the normalizing constant is determined and the following formulas 
derived: 

E0O = ^- and P(V>k)= X /^tlT^ - ( 5 ) 

a— I r(2 — a) T(k + 1) 

From Stirling's approximation 

^^-FCT^j*""" 1 alld P(y ^ k) ~ r(2^j r ° ■ < 6 > 



The sequence d m k is decreasing in k for fixed m, thus the same is true for 
Pm,m-k/P(y = k). Therefore V stochastically dominates the jump size U, 
that is for all k > 1 

P(U >k\X = m)< P(V > fe) . (7) 

We like to investigate a coupling of U and V, where U < V a,.s. It is fairly 
obvious that this can be achieved in such a way that 

p(U = k\V = k) = lA gpgy = lA^f. (8) 

(Indeed one may put 

v(u - 1 1 v - k) - (i Pm ^ k Y {Pm ^ - P{v = J))+ 

1 J ' ; "V P{V = k)J P(U < km) - P(V < fc m ) 

for j y^ k with fc m = min{fc > 1 : P m ,m-k < P(V = A;)}. There are other 
possibilities, later it will be only important that we commit to one of them.) 

Lemma 3. For a coupling (U, V) fulfilling (jSJ) it holds 

P(U 4V)< l —— and P(V >k\U^V)< ck l ~ a . 

(a — l)m 

for all k > 1 and some c < oo, which does not depend on m. 

Proof. Because of a < 2 

(m — k) ■ ■ ■ (m — 1) (m — k) ■ ■ ■ (m — 1) m — k 



(m + a — k — 1) ■ ■ ■ (m + a — 2) (m — k + 1) ■ • -m m 

and because of a > 1 

(m + a — k — 1) • • • (m + a — 2) /m + a — l\ k n — 1 
> ( ) > 1 + k , 



(m — k) ■ • • (m — 1) V m ) m 

consequently 

k a m ]~ i 



m d m i + ( a -i)A 



It follows 

A;\_ d 



(l - -)P(V = fc) < ^P m , m - fc < P(V = k) 

\ 171/ dm 



for all k > 1 with d = a/T(2 — a). Summing over k yields 
l_l E (y)<A<i or !<%< ! 



m d m d (1 — 7 — \y- v 

'" \ (a— ljm' 



Combining the estimates we end up with 



1 _ k_ < drnk < 1 (g) 

m~ d - (l + ( a _l)A)(i_ i_)+ K) 

v v / m' V (a— ljm' 



for all fe > 1. 

Now from ©, Q 



P(^ ^ V) = ^( P (^ = fc ) - P m,m- fe )" 



fc>l 



fe>i fc>i 

thus from (jHJ) 

P(^ 7^ V) < \ 

(a — l)m 

which is our first claim. 

Also, letting m > 2/ (a - 1) and fc' = 2 [(a - l)" 2 + (a - l)" 1 ] then 

(1 + (a - 1)^)(1 - r^) + = 1 + (a - 1)^ - ^-W- - -^ 

> l + ^p-^-r-^- >l + i . 

— 2 m (a— ljm — m 

From (|SJ) 

dmk' ,1 1 
1 — > 1 > 

d ~ 1 + i ~ 2m 

m 

and from (jHJ) 
P(C/ ^ F) > P(£7 ^ A;', V = k') = (1 - i*!)+P(y = A;') > — P(V = k') 



for m > 2 /(a — 1). It follows that there is a 77 > such that for all m > 1 

P(C/^1/) > — . 

8 



Now from (JHD and © 



P(V = k \U^V) 



P(U ± V) 



^mk 



1 , , 

" ; .p(y = jfc) < ^(V = jfe) 



P(tf^V) 
and the second claim follows from (jHJ). □ 

3 The coalescent's point process 

Let \i denote a point process on {2, 3, . . .}. For any interval I let /// be the 
point process on {2, 3, . . .} given by 

fn(B) = fi(B H I) , 5C {2,3,...}. 

We call fj, a coalescent's point process downwards from oo, shortly a 
CPP(oo), if the following properties hold: 

• /i({2, 3, ...}) = oo and /i({n}) = or 1 for any n > 2 a.s. 

• For n > 2 we have: Given the event /i({n}) = 1 and given /i[ n +i,oo) the 
point process /X[2,n] is a CPP(n) a.s. 

Recall that a point process is called a CPP(n), if it can be represented as in 
©• 

Theorem 4. Let 1 < a < 2. TTien iTie CPP(oo) exists and is unique in 
distribution. 

We prepare the proof by two lemmas. 

Lemma 5. Let [i be a CPP(n) with 1 < n < oo. T/ien /or any e > £/iere is 
a natural number r such that for any interval I = [a, b] with 2 < a < b < n 
and b — a > r we have 

P(Ai(J) = 0) < e . 

Proof. For I = [a,b] 

n 

Oz(I) = 0}= |J {ft({m}) = l,(JL([a,m-l]) = 0}&.s., 

m=b+l 



since /x({n}) = 1 for n < oo and /x({2, 3, ...}) = oo a.s. for n = oo. Thus 
from (TfJ) 

n 

P(/i(J) = 0) < J^ P(X X < a | X = m) . 

m=b+l 

Applying ((7j) to U = X — Xi it follows that 

n oo 

P( y u(/)=0)< J^ P(K>m-a) < J2 P ( V > h ~ a + k ) ■ ( 10 ) 

m=b+l k=l 

Since E(V) < oo, this series is convergent and the claim follows. □ 

The next lemma prepares coupling of CPPs. 

Lemma 6. Let fi, fi' be two independent CPPs coming down from n, n' < oo. 
Then for any e > there is a natural number s such that for any b sufficiently 
large and n,n' > b we have 

p (M{i}) =/AO'}) = 1 for some j = b - s, . . . , b]) > 1 - e . 

Proof. First let n < oo. We construct a coupling of a CCP(n) ji to an 
i.i.d. random sequence. Consider random variables U\, V\, U2, V2, ■ ■ ■ and 

n — Xq , X\ , . . . with Xi — n — U\ Ui, which are constructed inductively 

as follows: If Ux, Vi, . . . , Ui, Vj are already gotten, then given the values of 
these random variables let VJ +1 be a copy of the random variable V from 
section 2 and couple f/j + i to V^+i as in section 2, with m = X^. For defmitness 
put Ui + \ = if Xi = 1. Then Vi,V2, ■ ■ ■ are i.i.d. random variables with 
distribution (Jl]) and X > Xi > . . . > X Tn _i are the points of a CPP(n) fi 
down from n, where r n is the natural number i such that X« = 1 for the first 
time. 

Now let A; be a natural number. Then Xj_i > n — JJ\ — ■ ■ ■ — Uk for i < k. 
Thus for any n > and n > 6/c?7 _1 E(V) + 2 from Lemma |3] 

P(Ui ^ Vi for some i < k, U x + • ■ ■ + U k < Qkn^EiV)) 

<tmt K, A Vl > » - 6*,-'E(V0) < (a . 1)(B _* 6fa ,- 1E(y)) . 

thus 

Pfe ^ Vi for some i < k, U x H h U k < 6A;77 _1 E(F)) < - 

6 

10 



if n is large enough. Also E(CZj) < E(V") because of ([7]), thus from Markov's 
inequality 

P(l7i + • • • + U k > Gk^EiV)) < | . (11) 

and consequently 

P(Ui 7^ Vi for some i <k) < — 

if n is sufficiently large (depending on n and k). 

Next let I be a natural number and n' — n + l. Let U[, V[, U' 2 , V 2 , . . . and 
n' = Xq,X[, ... an analogue construction with random variables, which are 
independent of Ui, Vi, U2, V2, . . . Then also 

P(U( ^ V- for some i < k) < 1 . 

Moreover because V has finite expectation and because of independence from 
classical results on recurrent random walks 



P (S> ± E v i - l for a11 3< k) < 



1=1 1=1 



if only k is sufficiently large (depending on I). Combining the estimates we 

obtain 

3 3 



5r/ 



P (E [/ ^ 12 u i - l for a11 3< k ) < 

i=l i=l 

For the corresponding independent CPPs [i and \J coming down from n and 
n' = n + l this implies together with ( TTT1) 

P(//({j}) = /i'({j}) = 1 for some j G [n - 6A;^ 1 E(V A ),n]) > 1 — 77 . 

Leaving aside the coupling procedure we have proved: Let 77 > 0, let / be 
a natural number and let \i and \j! denote independent CPPs coming down 
from n < 00 and n' — n + 1. Then there is a natural number r' such that 

P (MO'}) = At'CO'}) = X for some 3 =n-r',...,n) > 1 - 77 , (12) 

if only n is large enough. 
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With this preparation we come to the proof of the lemma. Let e > 0, 
b > 2 and let n, n' > b. Denote 

M = max {k < b : fi({k}) = 1} , M' = max {fc < b : //({&}) = 1} 

(with the convention M = 1, if /x([2, 6]) = 0). From Lemma [5] 

P(M,M' G [6-r,6]) > 1-- 

for some r and 6 > r + 2. Then 

P(M0» = M'(0'}) = 1 ^ no je[b-r'- r, 6]) 
< - + P(/i({j}) = /i'({j}) = 1 for no j e[b-r' -r,b};b-r < M, M' < b) 

< £ - + 2 J2 P (M{J'}) = ^({J}) = 1 

b—r<m<m' <b 

for no j = m — r', . . . ,m \ Xq = m, X' = m') 

From (TT21) it follows that the right-hand probabilities are bounded by r\ = 
e/4r 2 , if b is only sufficiently large. Then 

P(M{j}) = /AO'}) = 1 for no j e[b-r' - r, b}) < s , 

which is our claim with s = r + r'. D 

As a corollary we note: 

Lemma 7. Let // and // 6e two independent CPP(oo). Then a.s. /x({i}) = 
/i'({j}) = 1 /or infinitely many j e N. 

Proof. From the preceding lemma there are numbers numbers &i < 62 < ■ • • 
such that 

P(/i({j}) = //'({j}) = 1 for no j = b k , ... , 6*+i]) < 2~ k . 

Now an application of the Borel-Cantelli Lemma gives the claim. □ 

Proof of Theorem ^ The existence follows from the fact that for a > 1 the 
corresponding Beta-coalescent (ILjt>o comes down from infinity [17], which 
means that the number of blocks in Yl t is a finite number N t for each t > 0. 
Put n({k}) = l,iSN t = k for some t > 0. 

Uniqueness follows from the last lemma and a standard coupling argu- 
ment. □ 

12 



4 A bigger coupling 

Now let v a stationary renewal point process on {2,3,...}, that is, if we 
denote the points of v by 2 < R\ < i? 2 < • • • , then the increments R4+1 — Ri 
are independent for i > (with R$ = 1) and Ri+i — Ri has for i > 1 the 
distribution (B|. A stationary version of the process exists, since E(V) < 00, 
such that the distribution of R\ may be adjusted in the usual way to obtain 
stationarity: 

Pfoi = = l ~ {v) ] , r = 2,3,... (13) 

Stationarity is of advantage for us. Then v may be considered as restriction 
of a stationary point process on Z. Such a process is invariant in distribution 
under the transformation z 1— > Zq — z, z e Z with z ^ ^- Therefore u, 
restricted to {2, . . . , n} looks the same, when considered upwards or down- 
wards. 

In this section we introduce a coupling between v and the CPP(oo) /i, 
which allows us later to replace \x by v. Given 6 > 2 let as above 

M = max{k < b : fi({k}) = 1} , M' = max{k < b : u({k}) = 1} . 

Again, if there is no k < b such that //({&}) = 1, we put M = 1, and similary 
for M' . Let Xf, and X' b denote the distributions of M and M' (both dependent 
on b). 

Now for r G N we consider the following construction of \x and v, restricted 
to [T- 1 + l,2 r ]. Take any coupling (M,M') of X 2 r and A' 2r . Given (M,M') 
construct random variables Ux, V\, U2, V2, . . . inductively as in the proof of 
Lemma [6], using the coupling of section 2. Here we start with Xq = M. Also 
let Y = M', 

X i = M-U 1 Ui, Y i = M'-V 1 Vi, i>l, (14) 

and 

iV = min{i> 0:X, <2 r '- 1 } , N' = mm{i > : Y t < 2 1 "- 1 } . (15) 

The whole construction is interrupted at the moment iV V N'. Maybe 
M,M' < 2 r_1 , then no step of the construction is required. Clearly the 
following statements are true: 

13 



• The point process J2 i=0 Sx t is equal in distribution to //, restricted to 

• The point process J2 i=0 <% is equal in distribution to v, restricted to 

[ 2 r-l + lj2 r ]. 

• Xjy and Y/v V 1 have the distributions A 2 r-i and A^r-i- 
The complete coupling is 

JV-l AT'-l 

^(m^o = (x^*> s ^ a,^, v x ) ( ie ) 

j=0 i=0 

= (01,02,035 04) ( Sa y) • 

Its distribution is uniquely determined by the distribution of the coupling 
(U, V) from section 2. The following continuity property is obvious: 



If we have a sequence (M n , M' n ) of couplings of A 2 r and A' 2 r such that 
and 



(M n ,M' n ) A (M,M'), then (M,M') is also a coupling of A 2 r and A' 2 



&\M n ,M' n ) A$ r (M,M') . 

Another obvious fact is that this construction can be iterated: Given 
Q r (M,M') we construct $ r_1 (03, 0^) and so forth. Thus starting with the 
independent coupling (M, M') (i.e. M and M' are independent) we obtain 
the tupel 

* r = ($ 1 > r (M 1>r , M( jr ), $ 2 ' r (M 2 , r , M^ r ), . . . , ^(M r , r , M; >r )) , 

where (M r , r , M^) = (M, M') and (M s>r , M^ r ) = ($ +1,r , 0f 1,r ) for s < r. 
Since M sr and M' s r are no longer independent in general, the tupels \l/ r are 
initially not consistent for different r. To enforce consistency note that for 
fixed s the distributions of (M s>r , M' s r ) are tight for r > s, since they take 
values in the finite set {1, ... ,2 s } x {1, ... ,2 s }. Thus by a diagonalisation 
argument we may obtain a sequence 1 < r\ < r 2 < • • • such that 

(m v „ ) m s ,j4(m S)CO ,m; ) J 

for certain couplings (M St00 ,M' soo ) of A 2 s and A 2S . 
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If we use instead of the independent coupling (M, M') now (M roo , M' roo ) 
as starting configuration in the construction of ^ r , then we gain consistency 
in the sence that 

v^-i £ ($V(M 1>r , M[ tr ), $ 2 ' r (M 2 , r , M^ r ), ..., <r-^(M r _ lir , M;_ 1>r )) . 

Proceeding to the projective limit we obtain the 'big coupling' 

y°° = ($ 1 ' 00 (M lj00 , M[ >0O ), $ 2 '°°(M 2i0O , M^J, . . . ) • (17) 

It has the property that 

oo oo 

/ i = 5Z^l'°° aIld Z/ = 5Z^2°° ( 18 ) 

r=l r=l 

are coupled copies of our CPP(oo) and stationary point process. 

In order to estimate the difference between both point processes we go 
back to (TH1) . (TT5]) and estimate the tail of the distribution of 

D r = max \Xi-YA . (19) 

i<NAN' 

Lemma 8. There is a constant c > such that for all r > 1 and all t > 

P(A- > t) < c£^ Q 
Proof. For i < N A N' we have 

|^<-ii|< $^ |£/,--^| + l*o-io| 

j<NAN' 

< J2 \Uj - V^l + (2 r - M) + {2 r - M') . (20) 

j<NAN' 

From ©, (flDl) 

P(2 r -M>i) < ^P(^ > k) < ct x ~ a (21) 

for a suitable c > 0. 
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Because of stationarity 2 r — M' and (R\ — 2) A (2 r — 1) are equal in 
distribution, therefore because of flS]), (TT51) 

P(2 r - M' > t) < (Ri >t)< ct l - a (22) 

for a suitable c > 0. 

Finally from Lemma [Hlf/j 7^ V,- occurs for j < N at most with probability 
p _ 2 1 ~ r /[a — 1) and then \Uj — Vj\ < Vj a.s. Also because of Lemma [3] 
these Vj can be stochastically dominated by random variables a + bQ with 
constants a, 6 > and positive i.i.d. random variables Q, which possess a 
stable distribution of index a — 1 and Laplace transform exp(— A" -1 ). Also 
NAN' < 2 r ~ 1 = w (say). Thus ^2 j<NAN i \Uj — Vj\ is stochastically dominated 
by the random variable 

w 
j=0 

where Ij are i.i.d. Bernoulli with success probability p. Let ^p(X) = 
exp(— aX — (5A) Q_1 ) be the Laplace transform of a + bQ. Then W has the 
Laplace transform 

a(X) = {1 - p(l - V (X))) W 

It follows 1 — er(A) < wp(l — (p(X)) < (1 — (p(X))/(a — 1). From the well-known 
identity A / °° e" Aa: P(iy > x) dx = 1 - cr(A) it follows that 

e - 1 P(py > t) < r 1 / e - x,/ 'P(iy > x) dx 

Jo 
= 1 - a(l/t) < ^(1 - exp(-ar 1 - (to) 1 -")) . 

Thus 

P ( 5Z l^i ~~ F jI > *) - P ^ > *) - ctl_a ( 23 ) 

j<NAN' 

for a suitable c > 0. Using the estimates (J2T|) to ( 1231) in (12"U|) yields our 
claim. D 

Additionally we note that 

\N-N'\<D r . (24) 

Indeed, if iV < N', then A w < 2 r ~\ thus Fjv < 2 r ~ l + D r . Further Y^^ > 
2 T -\ which implies JV' - 1 - N < Y N - Fjv'-i < A ■ - 1- The case N' < N 
is treated in the same way. 
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5 On sums of independent random variables 

The following lemma can be deduced from well-known results (see f.e. Petrov 
[H]), but a direct proof seems more convenient. Let 

1 

7 = 7 • 

a — 1 

Lemma 9. Let 14, V 2 , . . . be i.i.d. copies of the random variable (|IJ). Then 
for any (3 e R and any e > a.s. 

n 

5>^(V fe - 7) = ^ + o(n^ +£ ) , 
fe=i 

where r] n is a.s. convergent. 

Proof. Let e > 0. A short calculation gives that E(V fc 2 ; V*. < /c~ +e ) is of order 

fe 2-l + (2-a) Cj thug 

oo 

^ A;-^ £ (\41 yfc<fc i +£ - E(\4; V fc < fc=+-)) 
fc=i 

is a.s. convergent. Also E(Vfc; Vfe > A> +£ ) is of order less than fc« _1 and 
P(Vfc > A> +£ ) is of order A; _1_ae such that 14 > /c~ +e occurs only finitely 
often a.s. Thus 

oo 

fc=l 
is a.s. convergent for all e > 0. 

For /3 > - it follows that the sum Y^l=\ ^ ~ {Yk ~ l) is a - s - convergent, 
which is our claim (then the term o{n^~^ +e ) is superfluous). In the case 
(3 < - by Kronecker's Lemma a.s. 



X>^(14- 7 ) = o(n^ +e ), 



fc=i 



which again is our claim (now r\ n is superfluous). □ 

Next recall that q denotes a random variable with maximally skewed stable 
distribution of index a as in ()3]). The following result can be deduced from a 
general statement on triangular arrays of independent random variables (see 
[TU] . chapter XVII, section 7), however a direct proof seems easier. 

17 



Lemma 10. Let Vy, V%, . . . be independent copies of the random variable 
Then the following holds true: 

(i) Let 1 < a < |(1 + V5). Then 

n 

k=i 

where 

c= ((l + a-a 2 )T(2-a)y« . 

(ii) For a = i(l + V5) 

n 

(logn)-^X;^" a (^-7) 



— > 



fe=i 



r(2-a) 



Proof, (i): From (jSJ), (EI) and the theory of stable laws it follows that 
n~ s (Vi H h K - 7n) ->■ 



r(2-a)a 



We express this relation by means of the characteristic functions <p(u) and 
e^ofK-7and-?/r(2-a) 1 /°: ^(n~^M) n -> e^ for all u G R or slightly 
more general 

<p(v n n-^u) n -> e^ (u) , 

if f n — )■ 1. Since (p n (u) = <p(v n n~"u) is again a characteristic function, it 
follows from Feller [TDj, chapter XVII. 1 Theorem 1, that for n — > oo 

n{tp{y n n~au) — !)—>■ ^(it) 



or 



<^(sw) — 1 ~ s a ip{u) , as s — t- 

for all real it. Since a — a 2 > — 1 for a < |(1 + Vo), it follows that with 
£ = (l + a - a 2 )^ 



fc=l ^ 



a 1 - 



*«)E (£5?)* ->*«)■ 



fc=l ^ a 
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Similarly 



and consequently 



£ 

fc=i 



<P 



(k 



l-Q 



n 1 -^^ 



r « -1 



-)• WW 



fc=l 



(A- 



l-a 



Ekbi^rx M - 1 



ji- q+ -« 



< max 

k=l,...,n 



V 



(k 



l-Q 



n 1 "" 1 a 



-?/, 



1 > \<p\ r u — 1 



fc=l 



^0 



for n — > oo. 

In order to transfer these limit results to characteristic functions we use 
that for all complex numbers z with \z\ < 1 



\z-e z - x \ <c\z-l\ 2 



for some c > 0. Therefore, if |^i|, . . . , \z n \ < 1, 



zi • • • z„ - e 



(«i-i)h— •+(*„- 



1} | < 5Z l^fc - e**" 1 ! < c^ |^ fc - 1| 



fc=i 



fc=i 



We put Zfc = Zkn(u) = (p(-^—— T u). Then the right-hand side goes to zero 
and we obtain 



z ln (u) ■ ■ ■ z nn (u) -^e 



ip(u) 



Since the product on the left-hand side is the characteristic function of 
£ n a-i-- ^™ =i A; 1_a (\4 — -^zi) the claim follows, 
(ii): This proof goes along the same lines using 



Now a — d 2 = — 1, thus 

A / / A; 1 "" 

and the claim follows. 



n 

i)~vk«)x;( 



,1— a 



i v (logn)^ 



1 n i 

l) ~^(u), V- ~^( 

/ logn -^ k 

k=\ 



U 



D 
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6 Proof of Theorem 1 

Again let 2 < i?i < i?2 < • • • be the points of the stationary point process v 
and denote 

Vj = R j+1 - Rj , j > 1 . 

The random variables Vx, V 2 , . . . are i.i.d. with distribution (TJJ. 
Lemma 11. We have 

[ X l ~ a U(dx) = f ^ - T* E fcl ""(^ ~ 7) + $n 

J\2,n] 7(2 - a) f-i 



' [2,n 

with 



— ? 



#n = Vn + Op(n^ +1 a+£ ) 



for any e > 0, where the random variables rj n are convergent in probability. 
Proof. Our starting point is 



/ X 1 "" v(dx) = J2 R] 



a 



where r n is such that R Tn <n< R Tn +i- From Lemma [9] we have R n — jn 
o{n* +e ) a.s., which implies r n — - = o{n^ +£ ) a.s. 
By a Taylor expansion 



R\- a = M 1 -* + (i - ")(7*r a (^ - 7^ + % 



i-1 

= (7Z) 1 "" + (1 - a)( T i)- Q Y,(V S - 7) + <5? (25) 

where the remainder is a.s. of the order 

8" = 0(C a -\Ri - 7 *) 2 ) + 0(i~ a ) = o^' a - l+£ ) . 
We consider now the sums of the different terms in fl25|) : 

r n 1 — Q 

z — ' z — a 
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where T)' n is a.s. convergent. Further, putting a n = (a — 1) J2i> n ^ a -> 



a - «) E »*~° E^ - t) = a - «) E(^ - 7) E r ° 

»=1 i=l 3=1 »=J+1 

= a r „(^r- n +l - -Rl - 7?"n) - E fl ^^ ~ 7 ) • 

i=i 

The distribution of -R r „+i — ft does not depend on n because of stationarity, 
thus a rn (R rn+1 - R x - n) = Opin 1 ^). Also E?=i(S " j 1-a )(^ " 7) is 
a.s. convergent for a > 1, since a n — n 1_a = 0(n _a ) and since V has finite 
expectation. It follows 

r n i—1 



:i-«)E ra E(^--7) 



i=i 



r* a (n — 7r ; 



- E J 1 -"^ - 7) + ^ + Op^) , (27) 
i=i 



where 77" is a.s. convergent. Next 

E^ = ^+°(^" Q+£ ) a - s - ( 28 ) 

for all £ > 0, where rj'^ is a.s. convergent. Note that this formula covers two 
cases: If - < a, then the sum is a.s. convergent and the right-hand term is 
superfluous. Otherwise the term r/'" can be neglected. 
Furthermore another Taylor expansion gives 

= ^ n > + ( 7 r n ) 1 ~ a (n - 7 r n ) + o(n«~ a+£ ) a.s. (29) 

2 — a 2 — a 

Combining (|25|) to ( 129]) gives 

E ^ = 7^—) ~ 7- a E^^ - 7) + *7n + o(n^« +£ ) a.s. , (30) 
where rj n is convergent in probability. 
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Finally we consider the (loosely notated) difference 



«/7 r„ 

E ?-*& - i) = E ^""to - 7) - E-^to - 7) • 

j=r„+l i<n/7 j'=l 

For any random sequence of natural numbers s n such that s n = o{n~^ +e ) a.s. 
for all e > 

E(^ - 7) = Rsn+x -Rx- is n = o(sl +£ ) = o(n^ +2£+£2 ) a.s. 

i<S n 

Since r n — n/ r y = o(ria +£ ) a.s. for any e > 0, this implies for any e > in 
probability 

n/7 

This implies "Y^j= r +i(^j + 7) = 0p(riQ +£ ). Therefore 

n/7 

n/7 n/7 

<^| E (^-7)| + |(?) 1 "°'-ri- a | E to +7) 

j=r„+l j=r n +l 

' -^ +1 - Q+£ ) + 0(n- a (n - 7r„,))o P (n^ +£ ) 



op(n< 



<>rin^ +1 ~ a+£ ) + o P (n« 



— -'-"-- ! _1 ,,,,( ,,-- Q+2e ) 



Since -^ + 1 > -, we end up with 



n/7 



E 3 1 - a (y J ->y) = o P (n* +1 -~«) 



j=r n +l 



Combining this estimate with fl30|) gives the claim. 



D 
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Proof of Theorem Ql The total length (J2J) of the n-coalescent can be rewritten 



as 

Up* 

where E Q ,Ei,... denote exponential random variables with expectation 1, 
independent among themselves and from the JQ. 

From Lemma 2.2 in Delmas et al [7] we have for m — > oo 

Pm = ^m a + 0(m a - 1 ). (31) 

In the first step we replace the points n = X > X i > • • • of a CPP(n) 
by points of a CPP(oo): If we take independent versions of both then for 
given e > by Lemma M there is a natural number s > 1 such that with 
probability at least 1 — e they meet before n — s. From this moment both 
CPPs can be coupled. Thus, letting n > X' Q > X[ > ■ ■ ■ be the points of the 
coupled CPP(oo) within [2, n], independent of E , Ei, . . ., and 

r'-l 

j = KA i 



then due to the the coupling and fl3T|) for n sufficiently big 

P(|L n - L' n \ > ZaT{a)n l - a {E + ■■■ + E,)) < e . 

Since a > 1, L n — L' n — op(l), thus we may replace L n by L' n in our asymp- 
totic considerations. 

Thus we work now with a CPP(oo) ft, which we couple to a stationary 
point process v according to f[T7|) and (fTg|) . Also let E , E 1 , . . . be independent 
of the whole coupling. We use the formula 

L' n = [ ^n(dx), (32) 

J[2,n] Px 

in which the exponential random variables now are ordered differently. Since 
J2 x >i x ~ a Ex < oo a.s., it follows from f l3T|) that 



L' n = aT(a) / -^ n(dx) + r)i, n 

J\2,n] x 



[2,n] 
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where T)i >n is a.s. convergent. 

Next ^2 x>2 x ~ l ^ 2 ~ £ {Ex — 1) is a.s. convergent for any e > 0. It follows 
that J2 x >2 xl ~ a (Ex — 1) is a.s. convergent for a > § and else a.s. of order 
0(n2~ a+£ ). Given /i the same holds true for JL, x %zl n(dx), thus 



L' n = aT(a) f x l - a fx(dx) + r] 2 . n + o(n 3 ^- a+£ ) 



a.s. 



where again rj2 yn is a.s. convergent. 
Next from JTHj) with 2 s < n < 2 S+1 



x 1 a fi(dx) = / x 1 a v{dx) 

[2,n] ^[2,n] 



+ £ 



r=l 



x a (0i°°(dx) — 02°°(^ a; )) 



/ x 1 -" (#*°°(daO - $ ,0 °(efc)) 



'[2 s + l,n] 

From ([19]) and ([24]) we see that 



[2 r - 1 +l > 2 r 



x^ a (<ft°°(dx) - f 2 '°°(dx)) 



<2 r - L (a-l)(2 r - L )- a D r + 2(2 



■tr— 1\1— a 



D r 



and the same estimate holds for the last term above. In view of Lemma [8] 
and the Borel-Cantelli Lemma we conclude that 



x a fi(dx) = / x a v(dx) + rj 3: , 

[2,n] J[2,n] 



with 7]3 >n a.s. convergent. Altogether 

L' n = aT(a) f x^ a v(dx) + r/ 4 ,„ + o(n 3/2 ~ a+£ ) 

J[2,n] 

where 7]^ n is a.s. convergent. Finally Lemma [Til gives a.s. 



-/ _ r(a)a(a-l)^2- 



(2-a) 



n 



T(a)a(a-irJ2 kl ~ a ( V k-l) 



fe<s 



+ r/ n + o P {n^ +l ' a+£ ) + o(n 3 / 2 " a+ ^ 



(33) 
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for all e > 0, where r\ n now is convergent in probability. 
We are ready to treat the different cases of Theorem [TJ 
If 1 < a < (1 + v / 5)/2, then we use that 1/a > 1/a 2 and 1/a > 1/2. 

Therefore the three remainder terms in (1331) are all of order op{n^ +1 ~ a ) and 

thus may be neglected. The result follows from an application of Lemma fTOl 

The case a = (1 + Vo)/2 is treated in the same way. 

If a > (1 + a/5)/2, then ^ + 1 — a < and 3/2 -a < 0. Also from Lemma 

[9] it follows that J2 k< n. & 1_a (Vfc — 7) is a.s. convergent. Thus it follows from 

— 1 

023]) that L' n (2~a) n2 ~ a is convergent in probability. To see that the 

limit of L' n (and thus L n ) is nondegenerate, we go back to ( 1321) resp. 



T i r(a)a(a-l) 2- 
u n (2-a) n 






r(a)a(q-l) 2- 
(2-a) '* 

[3,n] " 



As shown the term in brackets in convergent in probability. Also fi({2}) = 1 
with positive probability. Since the exponential variable E 2 is independent 
from the rest on the right-hand side, the whole limit has to be non-degenerate. 
This finishes the proof. □ 
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